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METHOD AND SYSTEM FOR INDEXING CACHE SERVER ENTRIES BY MEANS OF UNIFORM 
RESOURCE LOCATOR TRANSFORMATION 

TECHNICAL FIELD OF THE INVENTION 

This invention relates generally to the field of data 
processing and, more particularly, to a method and system 
for uniform resource locator transformation. 

5 

BACKGROUND OF THE INVENTION 

As computers have grown increasingly important in 

today's society, the use of the Internet has also increased. 

The increased usage of the Internet has lead to an decrease 
10 in response time as busy web sites attempt to handle a large 

volume of traffic. 

One solution to handling large amounts of traffic at 

popular web sites is to cache frequently requested content 

from the web site at the user's computer and/or at other 
15 sites . By spreading the content requests around an 

increased number of locations, performance is improved by 

decreasing the load at a single web site. 

One of the limitations to many existing caching 

solutions is the inability to cache content that varies 
2 0 based on information in the request header as opposed to the 

uniform resource locator (URL) indicating the content item. 

For example, a popular site such as Yahoo may provide 

localized versions of web pages based on the language 

information in the header of the hypertext transport 
25 protocol (HTTP) request while using the same URL. 

SUMMARY OF THE INVENTION 

The present invention addresses the problems and 
disadvantages associated with prior systems. In particular. 
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the present invention provides a method and system for 
uniform resource locator transformation. 

One aspect of the invention is a method for 
communicating data comprising receiving a first request at a 
5 cache server. The first request has an associated first 
content item and has an associated header portion. The 
method further includes comparing the first content item and 
the header portion to predefined criteria. In addition, the 
method includes generating a second request based on the 
10 criteria, the header portion and the first content item, • the 
second request being associated with a second content item, 
and retrieving the second content item based on the second 
request . 

Another aspect of the invention is a system for 

15 communicating data comprising a computer readable memory and 
an application stored in the computer readable memory. The 
application is operable to receive a first request at a 
cache server. The first request has an associated first 
content item and has an associated header portion. The 

20 application is further operable to compare the first content 
item and the header portion to predefined criteria. In 
addition, the application is operable to generate a second 
request based on the criteria, the header portion and the 
first content item,, the second request being associated with 

25 a second content item, and retrieve the second content 
item based on the second request . 

The present invention provides important technical 
advantages . Various embodiments of the invention may have 
none, some, or all of these advantages. One such technical 

30 advantage is the capability for transforming the uniform 
resource locators (URLs) portion of a hypertext transport 
protocol (HTTP) request using information in the header 
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portion of the HTTP request. Including header information 
in the URL portion allows for the caching of header 
dependent content items. Thus, a site which returns 
different content based on, for example, language 
5 information in the header portion, using the same URL of, 
for example, "http://www.domain.com/" may now be cached. 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the present 
10 invention, the objects > and advantages thereof, reference is 
now made to the following descriptions taken in connection 
with the accompanying drawings in which: 

FIGURE 1 is a block diagram illustrating a uniform 
resource locator transformation system; 
15 FIGURE 2 is a chart illustrating an exemplary 

embodiment of transform criteria used with system of FIGURE 
1 for transforming uniform resource locators; and 

FIGURE 3 is a flow chart illustrating a method of 
transforming URLs using the system of FIGURE 1 . 

20 

DETAILED DESCRIPTION OF THE INVENTION 

A better understanding of the present invention will be 
realized from the detailed description that follows, taken 
in conjunction with the accompanying drawings. 

25 FIGURE 1 is a block diagram illustrating a uniform 

resource locator (URL) transformation system 10. System 10 
comprises a client 12, a network 14, a cache server 16 and 
an origin server 18. System 10 is operable to provide the 
capability to transform URLs used with the hypertext 

3 0 transport protocol (HTTP) and the hypertext markup language 
(HTML) such that information used by a World Wide Web (web) 
server in the HTTP header is included in the uniform 
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resource identifier (URI) portion of an HTTP request. 
Typically, caching systems use the URI portion of an HTTP 
request to identify cached content items. However, caching 
systems are reduced in effectiveness when origin servers 18 
5 provide differing content based on information in the HTTP 
header portion of the HTTP request because the cache system 
caches on the URI, not the header information. System 10 
provides the capability to transform the URI portion of the 
HTTP request to include information from the header portion 

10 of the HTTP request so that caching systems can cache 
header- dependent content items. 

Client 12 comprises a general purpose or specialized 
computing system operable to receive information from a user 
(not shown) , such as a Microsoft Windows based computer, a 

15 UNIX or Linux based computer or a Macintosh computer.- 
Client 12 may also comprise a web browser 20 stored in a 
computer readable memory 22 and executed by a processor 24 . 
Web browser 2 0 comprises any suitable HTTP client and is 
operable to generate one or more HTTP requests 26. Browser 

20 20 communicates with cache server 16 over network 14, and 
transmits and receives data to and from server 16. HTTP 
request 2 6 comprises a URI portion 28 and a header portion 
30. 

In the disclosed embodiment, URI portion 28 comprises a 
25 URL identifying the content requested by browser 20. For 
example, URI portion 28 may comprise 

"http://www.domain.com/index.html". In the disclosed 

embodiment, header portion 30 comprises header information 
included in an HTTP request. For example, header portion 30 
3 0 may comprise "content type = application/x-zip" or "language 
= en-us" . 
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Typically, users of a web browser, such as browser 20, 
identify a desired web page by the URL associated with the 
web page. In addition, a link to a second web page from a 
first web page specifies the second web page by the URL 
5 associated with the second web page. However, when the URL 
seen by the user is communicated to the server hosting the 
desired web page, the URL seen by the user is not the only 
information communicated. More specifically, the URL seen 
by the user forms a portion of the full http request sent to 

10 the web server hosting the desired web page. 

The full http request includes a header, such as header 
portion 30, and the URL seen by the user, such as URI 
portion 28. The header includes various information not 
included in the URL which is useful and sometimes necessary 

15 for proper handling of the http request and proper 
presentation to the user of the content referred to by the 
URL in the http request. The information included in the 
header may include the identity (formally, the "User-Agent") 
of the browser being used, such as Internet Explorer version 

20 4.2 or, more formally, "CERN-Lineiyiode/2 . 15 libwww/2 . 17b3 " . 
The header may also indicate the language (formally 
" Content -Language " ) that the user of the browser prefers, 
such as English or Japanese. The header may also indicate 
various other fields such as cache- control , connection, 

25 date, pragma, trailer, transfer-encoding, upgrade, via, 
warning, allow, content -encoding, content- length, content- 
location, content-md5, content -range, content-type, expires, 
last-modified, extension-header, accept, accept - charset , 
accept-encoding, accept -language, authorization, expect, 

30 from, host, if-match, if -modif ied-since, if -none -match, if- 
range, if -unmodif ied-since, max-f orwards , proxy- 

authorization, range, referer and other header information. 
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For more detailed information regarding http requests and 
http header information, see Request For Comments 2 616 
available at <http://www.w3.org/Protocols/> which is 
incorporated herein by reference. Often, header information 
5 is based on the choices made by a user of a web browser in 
the "Preferences" or "Options" portion of the web browser. 

Many web sites use the header information to select the 
content to present to the user in response to the http 
request. Note that for increased clarity, the following 

10 example may deviate from the formal requirements of an http 
request, however, properly formatted http requests are 
included within the scope of the invention. For example, 
"http://www.server.com/index.html" is a URL requesting the 
"index, html" content at "www.server.com". The URL is part 

15 of a first http request which may also indicate "Content- 
Language = Japanese" in the header portion to indicate that 
the user of the web browser which generated the http request 
prefers content to be presented in the Japanese language. 
When the web server at "www.server.com" receives the above 

20 first http request, the web server may examine both the 
header portion and the URL portion in determining which 
content to present. More specifically, multiple 

"index.html" web pages may be maintained at the web server 
for different languages. Thus, m response to the first 

25 http request, the web server provides the Japanese language 
version of "index.html" based on the information in the 
header portion. Continuing the above example, a second http 
request may have the same URL 

("http://www.server.com/index.html"), but "Content -Language 

30 = English". In response to the second http request, the web 
server may provide an "index.html" page in the English 
language. This example illustrates how a single URL 
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("http://www.server.com/index.html") can be interpreted at a 
web server to return distinct web pages based on information 
in the header portion of the http request. 

One problem experienced by existing caching systems is 
5 the inability to cache content that changes based on header 
information. This problem exists because caching systems 
typically identify cached content using only the URL 
associated with the content, not the header information. 
When "http://www.server.com/index.html" actually identifies 

10 multiple web pages, the cache system is unable to determine 
which of the multiple web pages to return because the header 
information is not included in the URL. The teachings 
provided herein involve a method and system for uniform 
resource locator transformation which would allow a cache 

15 system to cache header-dependent content, such as the 
"http://www.server.com/index.html" content described above. 

Network 14 comprises a data communication system 
operable to communicate data between client 12, cache server 
16 and origin server 18. Network 14 is shown in FIGURE 1 in 

2 0 two parts only to indicate a logical distinction between the 
communications path following between client 12 and cache 
server 16, and cache server 16 and origin server 18. 
However, network 14 may comprise a single network. For 
example, network 14 may be the Internet, an asynchronous 

25 transfer mode ("ATM") network, an Ethernet network, a 
Transmission Control Protocol/Internet Protocol (TCP/IP) 
network, an intranet or any other suitable computer 
networking technologies . For purposes of teaching the 
present invention, an exemplary embodiment will be described 

30 where network 14 comprises the Internet. 

Cache server 16 comprises any suitable combination of 
hardware and/or software operable to cache static and/or 
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dynamic content items. Cache server 16 further comprises 
transform criteria 40. Transform criteria 40 comprises 
rules and directives for transforming XJRLs received from 
clients 12 based on web browser 20, the destination origin 
5 server 18, and other information. For example, transform 
criteria 40 may comprise a table indicating the 
transformations to be applied. Cache server 16 is further 
operable to provide web server capabilities similar to those 
of origin server 18 for handling requests 26. 

10 Origin server 18 communicates with cache server 16 over 

network 14 . Origin server 18 comprises any suitable 
hardware and/or software executing on a computer for 
receiving and responding to requests 26. Origin server 18 
may comprise a single computer executing software or may 

15 comprise a plurality of computers each executing software. 
In the disclosed embodiment, origin server 18 comprises an 
HTTP server which may also be known as a web server. Origin 
server 18 may additionally support other protocols such as 
the file transfer protocol (FTP) . Origin server 18 

2 0 retrieves information from one or more data sources 19 in 
response to requests 26. Origin server 18 is operable to 
retrieve static content, such as prewritten text files, 
images, and web pages, from data sources 19 in response to 
requests 26. Origin server 18 is also operable to generate 

25 new, dynamic content, for example, by dynamically creating 
web pages based on content stored in the data sources 19 in 
response to requests 26. For example, origin server 18 may 
generate a new web page using a common gateway interface 
(CGI) script, generate a new web page from the result of a 

30 structured query language (SQL) request and perform other 
suitable content generation functions. Origin server 18 may 
also be operable to generate executable software, such as 
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applications and applets, in response to requests for data. 
For example, origin server 18 may generate a Java applet in 
response to an appropriate request 26. 

In operation, browser 20 generates request 26 for 
5 content from origin server 18. Cache server 16 intercepts 
request 26 from web browser 20 to origin server 18. For 
example, cache server 16 may intercept request 20 by having 
the domain name service (DNS) server direct request 26 for 
the Internet domain associated with origin server 18 to 

10 cache server 16. Stated another way, requests 26 addressed 
to origin server 18 may be routed to cache server 16 through 
the operation of the DNS server. 

After receiving request 26, cache server 16 examines 
URI portion 28 and header portion 3 0 and compares it to 

15 transform criteria 40. Criteria 40 specifies which requests 
26 to perform transformation on and how to transform 
selected requests 26. Criteria 40 may identify requests 26 
to be transformed based on information in either or both of 
URI portion 28 and header portion 30. More specifically, 

2 0 criteria 40 may indicate that information be added to 
request 2 6 and/or that information be deleted from request 
26. For example, criteria 40 may specify that if 

"www.domain.com" is found in URI portion 28 and "language = 
Japanese" is found in header portion 30, that request 26 be 

25 transformed to include the language information found in 
header portion 3 0 in URI portion 28 so that cache server 16 
can cache the content associated with request 26. For 
another example, criteria 40 may specify that if 
"www. domain. com/acct . cgi?user_id=Mike" is found in URI 

30 portion 28, that request 26 be transformed to remove the 
user_id information from URI portion 28. By removing 
information, user dependent data, such as user identifiers, 
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may be removed from URI portion 28 and allowing more 
efficient caching of web pages. More specifically, by 
removing the user dependent identifier information in the 
above example, only one copy of the page associated with 
5 "www. domain. com/acct . cgi?user_id=Mike" need be cached 
because once the user identifier information is removed, 
only "www.domain.com/acct.cgi" may remain which would 
indicate a single web page instead of multiple web pages. 
In general, any of the elements of header portion 30, such 

10 as cache-control, connection, date, pragma, trailer, 
transfer-encoding, upgrade, via, warning, allow, content- 
encoding, content-length, content-location, content-md5, 
content -range, content-type, expires, last-modified, 
extension -header, accept, accept-charset , accept-encoding, 

15 accept -language, authorization, expect, from, host, if- 
match, if -modif ied-since, if -none -match, if-range, if- 
unmodified- since, max-f orwards, proxy- authorization, range, 
referer [sic] and other header fields, may be associated 
with any portion, substring or other part of URI portion 28 

20 to indicate how to transform requests 26. Further details 
of criteria 40 are discussed in association with FIGURE 2. 
Typically, criteria 40 for a particular origin server 18 
will be configured using information from an administrator 
associated with the origin server 18 as the administrator 

25 will know which particular web pages and other content are 
identified by a single URL, but depend on header 
information. 

After request 2 6 has been transformed according to 
transform criteria 40, cache server 16 examines URI portion 
30 2 8 to determine whether the requested content has been 
cached at cache server 16. Since URI portion 28 has been 
transformed, any header information needed to identify the 
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particular content to be retrieved is now available in URI 
portion 28. Cache server 16 can then uniquely identify 
content that is header dependent using only the URI portion 
28 to identify the content. For example, a initial URI 
5 portion 28 of "http : //www. server . com/ index. html " and a 
header portion 3 0 of "Language = Japanese" may have been 
transformed into a new URI portion 2 8 of 
"http : //www. server . com/ index. html ?Language= Japanese" to 
identify the Japanese language version of the 

10 "http://www.server.com/index.html" web page. Continuing the 
above example, a further request 26 with an initial URI 
portion 2 8 of "http://www.server.com/index.html" and a 
header portion 3 0 of "Language = English" may have been 
transformed into a new URI portion 2 8 of 

15 "http://www. server. com/index. html?Language=English" to 
identify the English language version of the 
"http://www.server.com/index.html" web page. This example 
illustrates how a single URL 

"http://www.server.com/index.html" which refers to multiple, 

2 0 language dependent web pages, may be transformed to 
distinctly identify the multiple web pages for caching 
purposes . 

If the requested content has been cached at cache 
server 16, cache server 16 responds to request 26 with the 

2 5 cached copy of the requested content. If the requested 
content is not cached at cache server 16, cache server 16 
communicates the untransf ormed version of request 2 6 to 
origin server 18. Cache server 16 next receives the 
requested information from origin server 18. Cache server 

30 16 then transforms request 26 so as to be able to cache the 
requested content at cache server 16 and to do so 
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transparently to origin server 18. Cache server 16 then 
cotntnunicates the requested content to web browser 20. 

FIGURE 2 is a table illustrating an exemplary table for 
transform criteria 40, however, any other suitable 
5 implementation for criteria 40 may also be used. Transform 
criteria 40 may comprise match criteria 100 and an 
associated transform 102. Match criteria 100 comprises the 
information in URI portion 28 and header portion 3 0 that 
cache server 16 uses to determine which transform criteria 

10 40 to apply to request 26. Match criteria 100 may comprise 
a URL such as "www.domain.com", which would match that 
specific machine and domain regardless of the content 
requested from that domain. Match criteria 100 may also 
comprise a partial domain name such as ". domain . com" which 

15 would match any machine name within domain.com. Match 
criteria 100 may also utilize wild cards such as and 
"?". More specifically, the "*" wildcard may indicate any 
number of any character while the "?" wildcard may indicate 
a single occurrence of any character, other wildcards may 

20 also be used. For example, " . *domain. com" would match any 
machine in any domain ending with "domain.com" where * 
represents any number of characters preceding domain.com, 
thus, "www.mydomain.com" would be matched. For another 
example, ".domain.*" would match any machine in any generic 

25 top level domain (gTLD) with a domain of "domain." For 
further example, ".domain*.*" would match any machine name 
in a domain which started with "domain" and ended with any 
number of characters in any gTLD, thus, 
"www.domainofmine.com" would be matched, but not 

30 "www.mydomain.com". For yet another example, ".domainP.com" 
would match any machine name in any domain starting with the 
word "domain" and having a single additional character after 
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the word "domain", thus, "www.domainl.cotn" would be matched, 
but not "www.domain22.com". Match criteria 100 may also use 
wild cards in the content portion of the URL. For example, 
" .domain.com/data/car* .html" would match any machine name in 
5 "domain.com" which requested HTML content items in the data 
directory which started with the word "car" and followed by 
any number of characters, thus, 

"www.domain.com/data/car_porche.html" would be matched. In 
general, any suitable wildcard scheme, such as regular 

10 expression matching, may be used with match criteria 100. 

Criteria 40 may also be used to indicate 
transformations based on the referring web page. Thus, 
differing content may be provided to users based on how the 
user arrived at origin server 18. When a referring page 

15 includes referral information in URI portion 28 and/or in 
header portion 30, criteria 4 0 may indicate a transformation 
such that differing content is provided based on the 
referring page. For example, a request 26 with URI portion 
of "http://www.bookstore.com/index.html" and a header 

2 0 portion 3 0 including "referer = 

http://www.preferredsite.com" may be transformed to 
"http : / /www. bookstore . com/pref erredcustomer_index. html " . 
Request 2 6 may be transformed such that users who arrive at 
"www.bookstore.com" from "www.preferredsite.com" receive 
25 preferential treatment because of, for example, a 
partnership between "www.bookstore.com" and 

"www.preferredsite.com". In addition, since the "referer" 
header field may be set regardless of the contents of URI 
portion 28, web sites can indicate actions to take based on 

3 0 the referring site regardless of the referring site's 

assistance. In contrast to existing referral programs which 
typically require the referring site to include identifying 
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information in the URL, by using information in the 
referring field of header portion 30, the referring site can 
be identified without participation on the part of the 
referring site. Further, if a web site wishes to exclude 
5 visitors from a particular referring site, the 
transformation specified in criteria 40 for a particular URI 
portion 28 and a particular referring site in header portion 
30 may direct the user to an access denied page so that 
users from that particular referring site may be excluded, 

10 Transform 102 comprises an indication of what action to 

take for the particular match criteria 100. Each transform 
102 is associated with one or more match criteria 100. 
Transform 102 may also indicate multiple actions to be 
taken. For example, transform 102 may indicate that data in 

15 the "language field" be appended to the end of the URI 
portion 28. For another example, the "browser type" 
information in header 30 may be inserted in URI portion 28. 
Transform 102 may also indicate information to be removed 
from the matched URL. For example, transform 102 may 

20 indicate that a "user_id" element of URI portion 28 be 
removed . 

FIGURE 3 is a flow chart illustrating a method for 
transforming requests 26 at cache server 16. The method 
begins at step 200 where request 26 is received at cache 
25 server 16. Next, at step 202, cache server 16 examines 
request 26 against match criteria 100 in transform criteria 
40 . 

Then, at decisional step 204, cache server 16 
determines whether request 26 should be changed. More 
30 specifically, if URI portion 28 and/or header portion 30 of 
request 26 matches any one or more of match criteria 100, 
cache server 16 will apply transforms 102 associated with 
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the matched element of match criteria 100. For example, a 
request 26 with a URI portion 28 including 
"server.domain.com" may match the second exemplary entry of 
match criteria 100 (FIGURE 2) . Once a match has been found 
5 between one or more entries of match criteria 100 and 
request 2 8 of request 26 then server 16 will transform 
request 26. More specifically, if a match is found between 
request 26 and match criteria 100 then the YES branch of 
decisional step 204 leads to step 206. At step 206, cache 

10 server 16 transforms URI portion 28 of request 26 according 
to each match criteria 100 matched in step 2 04 according to 
the transform 102 associated with the matched elements of 
match criteria 100. If no matches are found between URI 
portion 28 and match criteria 100 in step 204 then the NO 

15 branch of decisional step 204 leads to decisional step 208. 

At decisional step 208, URI portion 28, which may have 
been transformed in step 2 06, is used to determine whether 
the content requested by request 26 is cached at cache 
server 16. If the requested content is not cached at cache 

20 server 16 then the NO branch of decisional step 208 leads to 
step 210. At step 210, cache server 16 forwards the 
untransformed version of request 26 to origin server 18 and 
retrieves the requested content. Next, at step 212, the 
requested content item is cached at cache server 16, if the 

25 requested content item is cacheable. As some content items 
are uncacheable or have been specified as uncacheable, the 
content item may not be cached. At step 214 the requested 
content item is returned to client 12 . If the requested 
item is cached at cache server 16 then the YES branch of 

30 decisional step 208 is followed to step 214. 

System 10 provides the capability to transform URLs so 
that header- dependent content may be cached by at a cache 
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server. Existing web content caching systems which rely on 
the URL associated with the web content have previously 
lacked the capability to cache content which varied based on 
header information in the http request associated with the 
5 URL. System 10 provides the capability to transform the URL 
associated with header-dependent content such that caching 
systems can uniquely identify the header-dependent content 
using the associated URL. 

In addition, system 10 may be used to provide 

10 additional functionality to cache servers and other systems. 
For example, the ability to include the URL of a referring 
site in a request's URL without the assistance of the 
referring site provides the capability to site 
administrators to exclude visitors from particular sites. 

15 Other changes, substitutions and alterations are also 

possible without departing from the spirit and scope of the 
present invention, as defined by the following claims. 
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WHAT IS CLAIMED IS: 

1. A method for communicating data comprising: 
receiving a first request at a cache server, the first 

request having an associated first content item and having 
5 an associated header portion; 

comparing the first content item and the header portion 
to predefined criteria; 

generating a second request based on the criteria, the 
header portion and the first content item, the second 
10 request being associated with a second content item; and 

retrieving the second content item based on the second 
request . 

2 . The method for communicating data according to 
15 Claim 1, wherein the header portion comprises a hypertext 

transport protocol header portion and wherein comparing the 
request comprises : 

examining a hypertext transport protocol identifier 
portion associated with the first content item; 
2 0 comparing the hypertext transport protocol identifier 

portion to the criteria; 

examining the hypertext transport protocol header 
portion associated with the first request; and 

comparing the hypertext transport protocol header 
25 portion to the criteria. 

3 . The method for communicating data according to 

Claim 2, wherein the predefined criteria comprises a match 
criteria and an associated transform. 

30 
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4 . The method for communicating data according to 
Claim 3 , wherein the transform comprises at least one rule 
indicating how to modify the hypertext transport protocol 
identifier portion associated with the first request to 

5 generate the second request . 

5. The method for communicating data according to 
Claim 3, wherein the transform comprises at least one rule 
indicating an element associated with the hypertext 

10 transport protocol header portion of the first request to be 
associated with the hypertext transport protocol identifier 
portion of the second request. 

6. The method for communicating data according to 
15 Claim 3, wherein the match criteria comprises at least one 

entry, each entry comprising a portion of a hypertext 

transport protocol identifier and comparing the hypertext 

transport protocol identifier portion to the criteria 

comprises comparing each entry to the hypertext transport 
2 0 protocol identifier portion of the first request. 

7. The method for communicating data according to 
Claim 1, wherein retrieving the second content item 
comprises: 

25 retrieving the second content item based on the second 

request from the cache server when the second content item 
is available from the cache server; and 

retrieving the first content item based on the first 
request from the origin server when the second content item 

30 is unavailable from the cache server. 
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8 . The method for communicating data according to 
Claim 7, wherein the second content item is related to the 
first content item. 



5 9. The method for communicating data according to 

Claim 7, wherein the second content item comprises a version 
of the first content item customized in response to data in 
the header portion associated with the first request. 



10 10. The method for communicating data according to 

Claim 1, wherein generating the second request comprises: 

adding a hypertext transport protocol identifier 
portion of the first request to a hypertext transport 
protocol identifier portion of the second request; and 

15 associating an element associated with the header 

portion associated with the first request with the hypertext 
transport protocol identifier portion of the second request. 

11. A system for communicating data comprising: 

2 0 a computer readable .memory; 

an application stored in the computer readable memory 
and operable to: 

receive a first request at a cache server, the 
first request having an associated first content item and 
25 having an associated header portion; 

compare the first content item and the header 
portion to predefined criteria; 

generate a second request based on the criteria, 
the header portion and the first content item, the second 

3 0 request being associated with a second content item; and 

retrieve the second content item based on the 
second request . 
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12 . The system for commionicating data according to 
Claim 11, wherein the header portion comprises a hypertext 
transport protocol header portion and wherein the 
application is further operable to: 

5 examine a hypertext transport protocol identifier 

portion associated with the first request; 

compare the hypertext transport protocol identifier 
portion to the criteria; 

examine the hypertext transport protocol header portion 
10 associated with the first request; and 

compare the hypertext transport protocol header portion 
to the criteria. 

13 . The system for communicating data according to 
15 Claim 12, wherein the predefined criteria comprises a match 

criteria and an associated transform. 

14. The system for communicating data according to 
Claim 13, wherein the transform comprises at least one rule 

20 indicating how to modify the hypertext transport protocol 
identifier portion associated with the first request to 
generate the second request . 

15 . The system for communicating data according to 
25 Claim 13, wherein the transform comprises at least one rule 

indicating an element associated with the hypertext 
transport protocol header portion of the first request to be 
associated with the hypertext transport protocol identifier 
portion of the second request. 

30 
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16. The system for conununicating data according to 
Claim 13, wherein the match criteria comprises at least one 
entry, each entry comprising a portion of a hypertext 
transport protocol identifier and comparing the hypertext 

5 transport protocol identifier portion to the criteria 

comprises comparing each entry to the hypertext transport 
protocol identifier portion of the first request. 

17. The system for communicating data according to 
10 Claim 11, wherein the application is further operable to: 

retrieve the second content item based on the second 
request from the cache server when the second content item 
is available from the cache server; and 

retrieve the first content item based on the first 
15 request from the origin server when the second content item 
is unavailable from the cache server. 

18. The system for communicating data according to 
Claim 17, wherein the second content item is related to the 

2 0 first content item. 

19 . The system for communicating data according to 
Claim 17, wherein the second content item comprises a 
version of the first content item customized in response to 

25 data in the header portion associated with the first 
request . 
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20. The system for communicating data according to 
Claim 11, wherein the application is further operable to: 

add a hypertext transport protocol identifier portion 
of the first request to a hypertext transport protocol 
5 identifier portion of the second request; and 

associate an element associated with the header portion 
associated with the first request with the hypertext 
transport protocol identifier portion of the second request. 
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